Overview

Dataset statistics

Number of variables52
Number of observations11573
Missing cells328835
Missing cells (%)54.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory23.6 MiB
Average record size in memory2.1 KiB

Variable types

CAT45
NUM7

Reproduction

Analysis started2020-04-28 23:10:12.943292
Analysis finished2020-04-28 23:11:21.227991
Versionpandas-profiling v2.6.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
saledate has a high cardinality: 94 distinct values High cardinality
fiModelDesc has a high cardinality: 1731 distinct values High cardinality
fiBaseModel has a high cardinality: 786 distinct values High cardinality
fiSecondaryDesc has a high cardinality: 81 distinct values High cardinality
fiModelSeries has a high cardinality: 59 distinct values High cardinality
fiModelDescriptor has a high cardinality: 73 distinct values High cardinality
fiProductClassDesc has a high cardinality: 67 distinct values High cardinality
ProductGroupDesc is highly correlated with ProductGroupHigh Correlation
ProductGroup is highly correlated with ProductGroupDescHigh Correlation
MachineHoursCurrentMeter has 6834 (59.1%) missing values Missing
UsageBand has 7542 (65.2%) missing values Missing
fiSecondaryDesc has 3536 (30.6%) missing values Missing
fiModelSeries has 9814 (84.8%) missing values Missing
fiModelDescriptor has 8676 (75.0%) missing values Missing
ProductSize has 5830 (50.4%) missing values Missing
Drive_System has 8847 (76.4%) missing values Missing
Forks has 5935 (51.3%) missing values Missing
Pad_Type has 9611 (83.0%) missing values Missing
Ride_Control has 7451 (64.4%) missing values Missing
Stick has 9611 (83.0%) missing values Missing
Transmission has 6796 (58.7%) missing values Missing
Turbocharged has 9611 (83.0%) missing values Missing
Blade_Extension has 10809 (93.4%) missing values Missing
Blade_Width has 10809 (93.4%) missing values Missing
Enclosure_Type has 10809 (93.4%) missing values Missing
Engine_Horsepower has 10809 (93.4%) missing values Missing
Hydraulics has 2010 (17.4%) missing values Missing
Pushblock has 10809 (93.4%) missing values Missing
Ripper has 8765 (75.7%) missing values Missing
Scarifier has 10809 (93.4%) missing values Missing
Tip_Control has 10809 (93.4%) missing values Missing
Tire_Size has 8653 (74.8%) missing values Missing
Coupler has 4846 (41.9%) missing values Missing
Coupler_System has 10057 (86.9%) missing values Missing
Grouser_Tracks has 10060 (86.9%) missing values Missing
Hydraulics_Flow has 10060 (86.9%) missing values Missing
Track_Type has 8533 (73.7%) missing values Missing
Undercarriage_Pad_Width has 8529 (73.7%) missing values Missing
Stick_Length has 8530 (73.7%) missing values Missing
Thumb has 8529 (73.7%) missing values Missing
Pattern_Changer has 8530 (73.7%) missing values Missing
Grouser_Type has 8533 (73.7%) missing values Missing
Backhoe_Mounting has 9533 (82.4%) missing values Missing
Blade_Type has 9531 (82.4%) missing values Missing
Travel_Controls has 9530 (82.3%) missing values Missing
Differential_Type has 9420 (81.4%) missing values Missing
Steering_Controls has 9420 (81.4%) missing values Missing
auctioneerID has 129 (1.1%) zeros Zeros
MachineHoursCurrentMeter has 708 (6.1%) zeros Zeros

Variables

SalesID
Real number (ℝ≥0)

UNIQUE
Distinct count11573
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5180809.187246176
Minimum1222837
Maximum6333349
Zeros0
Zeros (%)0.0%
Memory size90.5 KiB

Quantile statistics

Minimum1222837
5-th percentile1225118.6
Q14312616
median6264848
Q36286342
95-th percentile6317178.8
Maximum6333349
Range5110512
Interquartile range (IQR)1973726

Descriptive statistics

Standard deviation1619443.493
Coefficient of variation (CV)0.3125850489
Kurtosis0.9371250427
Mean5180809.187
Median Absolute Deviation (MAD)1340449.682
Skewness-1.401197889
Sum5.995750472e+10
Variance2.622597228e+12
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1222837. 1224593. 1224709.5 1224736.5 1224871.5 ... 6324810. 6327455.5 6328205.5 6333200. 6333349. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
6271032 1 < 0.1%
 
4300129 1 < 0.1%
 
6274397 1 < 0.1%
 
4312411 1 < 0.1%
 
6266201 1 < 0.1%
 
6303694 1 < 0.1%
 
6301014 1 < 0.1%
 
6258005 1 < 0.1%
 
1226068 1 < 0.1%
 
4261202 1 < 0.1%
 
Other values (11563) 11563 99.9%
 
ValueCountFrequency (%) 
1222837 1 < 0.1%
 
1222839 1 < 0.1%
 
1222841 1 < 0.1%
 
1222843 1 < 0.1%
 
1222845 1 < 0.1%
 
ValueCountFrequency (%) 
6333349 1 < 0.1%
 
6333348 1 < 0.1%
 
6333347 1 < 0.1%
 
6333345 1 < 0.1%
 
6333344 1 < 0.1%
 

MachineID
Real number (ℝ≥0)

Distinct count9681
Unique (%)83.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1651494.6219649184
Minimum150
Maximum2485252
Zeros0
Zeros (%)0.0%
Memory size90.5 KiB

Quantile statistics

Minimum150
5-th percentile214793.4
Q11067304
median1862151
Q32270530
95-th percentile2304793.8
Maximum2485252
Range2485102
Interquartile range (IQR)1203226

Descriptive statistics

Standard deviation652248.5331
Coefficient of variation (CV)0.3949443882
Kurtosis-0.03515233719
Mean1651494.622
Median Absolute Deviation (MAD)532712.0637
Skewness-1.037874401
Sum1.911274726e+10
Variance4.25428149e+11
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1.5000000e+02 7.3879000e+04 1.2497350e+05 1.6526550e+05 1.8422000e+05 ... 2.3135425e+06 2.3135830e+06 2.3138090e+06 2.4841210e+06 2.4852520e+06], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2283592 22 0.2%
 
2285830 19 0.2%
 
1896854 18 0.2%
 
1746392 15 0.1%
 
2300370 14 0.1%
 
2293171 14 0.1%
 
2268800 13 0.1%
 
2282547 13 0.1%
 
2297316 12 0.1%
 
2313570 12 0.1%
 
Other values (9671) 11421 98.7%
 
ValueCountFrequency (%) 
150 1 < 0.1%
 
257 1 < 0.1%
 
706 1 < 0.1%
 
1012 1 < 0.1%
 
1488 1 < 0.1%
 
ValueCountFrequency (%) 
2485252 1 < 0.1%
 
2484623 1 < 0.1%
 
2484122 1 < 0.1%
 
2484120 1 < 0.1%
 
2478468 1 < 0.1%
 

ModelID
Real number (ℝ≥0)

Distinct count1763
Unique (%)15.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8940.13583340534
Minimum28
Maximum37197
Zeros0
Zeros (%)0.0%
Memory size90.5 KiB

Quantile statistics

Minimum28
5-th percentile1083.4
Q13362
median4763
Q314303
95-th percentile23931
Maximum37197
Range37169
Interquartile range (IQR)10941

Descriptive statistics

Standard deviation7807.393696
Coefficient of variation (CV)0.873296988
Kurtosis0.2842979286
Mean8940.135833
Median Absolute Deviation (MAD)6559.855447
Skewness1.055996828
Sum103464192
Variance60955396.32
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[2.80000e+01 3.00000e+01 5.00000e+01 6.40000e+01 7.60000e+01 ... 3.59390e+04 3.59975e+04 3.61320e+04 3.63095e+04 3.71970e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
4605 309 2.7%
 
3542 116 1.0%
 
3538 107 0.9%
 
3362 103 0.9%
 
13247 91 0.8%
 
23931 79 0.7%
 
1169 73 0.6%
 
9580 71 0.6%
 
22072 65 0.6%
 
4607 65 0.6%
 
Other values (1753) 10494 90.7%
 
ValueCountFrequency (%) 
28 6 0.1%
 
29 8 0.1%
 
31 4 < 0.1%
 
34 3 < 0.1%
 
43 9 0.1%
 
ValueCountFrequency (%) 
37197 2 < 0.1%
 
36932 1 < 0.1%
 
36894 1 < 0.1%
 
36879 1 < 0.1%
 
36779 2 < 0.1%
 

datasource
Real number (ℝ≥0)

Distinct count5
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean152.62265618249373
Minimum121
Maximum173
Zeros0
Zeros (%)0.0%
Memory size90.5 KiB

Quantile statistics

Minimum121
5-th percentile121
Q1149
median149
Q3172
95-th percentile172
Maximum173
Range52
Interquartile range (IQR)23

Descriptive statistics

Standard deviation14.87206399
Coefficient of variation (CV)0.09744335709
Kurtosis-0.02259949746
Mean152.6226562
Median Absolute Deviation (MAD)11.10451432
Skewness-0.3873764293
Sum1766302
Variance221.1782872
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[121. 140.5 160.5 172.5 173. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
149 7021 60.7%
 
172 3315 28.6%
 
121 1212 10.5%
 
132 24 0.2%
 
173 1 < 0.1%
 
ValueCountFrequency (%) 
121 1212 10.5%
 
132 24 0.2%
 
149 7021 60.7%
 
172 3315 28.6%
 
173 1 < 0.1%
 
ValueCountFrequency (%) 
173 1 < 0.1%
 
172 3315 28.6%
 
149 7021 60.7%
 
132 24 0.2%
 
121 1212 10.5%
 

auctioneerID
Real number (ℝ≥0)

ZEROS
Distinct count14
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.547481206255941
Minimum0
Maximum99
Zeros129
Zeros (%)1.1%
Memory size90.5 KiB

Quantile statistics

Minimum0
5-th percentile1
Q11
median1
Q33
95-th percentile99
Maximum99
Range99
Interquartile range (IQR)2

Descriptive statistics

Standard deviation22.30707735
Coefficient of variation (CV)2.955565803
Kurtosis12.54377359
Mean7.547481206
Median Absolute Deviation (MAD)10.77200996
Skewness3.774991416
Sum87347
Variance497.6056999
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 3.5 ... 11. 12.5 15.5 62.5 99. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1 7463 64.5%
 
3 1212 10.5%
 
2 1001 8.6%
 
99 636 5.5%
 
4 403 3.5%
 
8 216 1.9%
 
12 192 1.7%
 
0 129 1.1%
 
26 120 1.0%
 
10 86 0.7%
 
Other values (4) 115 1.0%
 
ValueCountFrequency (%) 
0 129 1.1%
 
1 7463 64.5%
 
2 1001 8.6%
 
3 1212 10.5%
 
4 403 3.5%
 
ValueCountFrequency (%) 
99 636 5.5%
 
26 120 1.0%
 
16 15 0.1%
 
15 62 0.5%
 
13 37 0.3%
 

YearMade
Real number (ℝ≥0)

Distinct count55
Unique (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1895.3318067916703
Minimum1000
Maximum2014
Zeros0
Zeros (%)0.0%
Memory size90.5 KiB

Quantile statistics

Minimum1000
5-th percentile1000
Q11993
median2001
Q32005
95-th percentile2007
Maximum2014
Range1014
Interquartile range (IQR)12

Descriptive statistics

Standard deviation305.4819013
Coefficient of variation (CV)0.1611759483
Kurtosis4.705841297
Mean1895.331807
Median Absolute Deviation (MAD)186.6016001
Skewness-2.588151402
Sum21934675
Variance93319.192
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1000. 1476.5 1955. 1962.5 1966.5 ... 2006.5 2007.5 2008.5 2010.5 2014. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2005 1509 13.0%
 
2006 1211 10.5%
 
1000 1206 10.4%
 
2004 894 7.7%
 
1998 530 4.6%
 
2007 525 4.5%
 
1999 507 4.4%
 
2003 500 4.3%
 
2000 496 4.3%
 
2001 447 3.9%
 
Other values (45) 3748 32.4%
 
ValueCountFrequency (%) 
1000 1206 10.4%
 
1953 1 < 0.1%
 
1957 1 < 0.1%
 
1958 3 < 0.1%
 
1962 2 < 0.1%
 
ValueCountFrequency (%) 
2014 2 < 0.1%
 
2011 13 0.1%
 
2010 33 0.3%
 
2009 44 0.4%
 
2008 269 2.3%
 

MachineHoursCurrentMeter
Real number (ℝ≥0)

MISSING
ZEROS
Distinct count3075
Unique (%)64.9%
Missing6834
Missing (%)59.1%
Infinite0
Infinite (%)0.0%
Mean5482.141380037982
Minimum0.0
Maximum89200.0
Zeros708
Zeros (%)6.1%
Memory size90.5 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11268
median3786
Q37793
95-th percentile16351.7
Maximum89200
Range89200
Interquartile range (IQR)6525

Descriptive statistics

Standard deviation6391.097182
Coefficient of variation (CV)1.165803057
Kurtosis23.65812955
Mean5482.14138
Median Absolute Deviation (MAD)4356.48669
Skewness3.373997252
Sum25979868
Variance40846123.19
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 708 6.1%
 
24 74 0.6%
 
48 43 0.4%
 
1553 5 < 0.1%
 
1916 5 < 0.1%
 
3311 4 < 0.1%
 
9070 4 < 0.1%
 
9192 4 < 0.1%
 
828 4 < 0.1%
 
7798 4 < 0.1%
 
Other values (3065) 3884 33.6%
 
(Missing) 6834 59.1%
 
ValueCountFrequency (%) 
0 708 6.1%
 
1 2 < 0.1%
 
11 2 < 0.1%
 
15 1 < 0.1%
 
18 1 < 0.1%
 
ValueCountFrequency (%) 
89200 2 < 0.1%
 
73860 1 < 0.1%
 
59431 1 < 0.1%
 
55214 1 < 0.1%
 
53839 1 < 0.1%
 

UsageBand
Categorical

MISSING
Distinct count3
Unique (%)0.1%
Missing7542
Missing (%)65.2%
Memory size90.5 KiB
Medium
1847
Low
1691
High
493
ValueCountFrequency (%) 
Medium 1847 16.0%
 
Low 1691 14.6%
 
High 493 4.3%
 
(Missing) 7542 65.2%
 

Length

Max length6
Mean length3.521385985
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 11 78.6%
 
Uppercase_Letter 3 21.4%
 
ValueCountFrequency (%) 
Latin 14 100.0%
 
ValueCountFrequency (%) 
ASCII 14 100.0%
 

saledate
Categorical

HIGH CARDINALITY
Distinct count94
Unique (%)0.8%
Missing0
Missing (%)0.0%
Memory size90.5 KiB
2/13/2012 0:00
1598
3/29/2012 0:00
 
765
2/12/2012 0:00
 
698
3/28/2012 0:00
 
584
1/28/2012 0:00
 
510
Other values (89)
7418
ValueCountFrequency (%) 
2/13/2012 0:00 1598 13.8%
 
3/29/2012 0:00 765 6.6%
 
2/12/2012 0:00 698 6.0%
 
3/28/2012 0:00 584 5.0%
 
1/28/2012 0:00 510 4.4%
 
3/22/2012 0:00 384 3.3%
 
2/6/2012 0:00 381 3.3%
 
3/8/2012 0:00 353 3.1%
 
3/26/2012 0:00 321 2.8%
 
3/14/2012 0:00 288 2.5%
 
Other values (84) 5691 49.2%
 

Length

Max length14
Mean length13.8146548
Min length13
ValueCountFrequency (%) 
Decimal_Number 10 76.9%
 
Other_Punctuation 2 15.4%
 
Space_Separator 1 7.7%
 
ValueCountFrequency (%) 
Common 13 100.0%
 
ValueCountFrequency (%) 
ASCII 13 100.0%
 

fiModelDesc
Categorical

HIGH CARDINALITY
Distinct count1731
Unique (%)15.0%
Missing0
Missing (%)0.0%
Memory size90.5 KiB
310G
 
309
420D
 
116
416C
 
107
140G
 
103
580MII
 
91
Other values (1726)
10847
ValueCountFrequency (%) 
310G 309 2.7%
 
420D 116 1.0%
 
416C 107 0.9%
 
140G 103 0.9%
 
580MII 91 0.8%
 
140HNA 79 0.7%
 
320CL 73 0.6%
 
T190 71 0.6%
 
310SG 65 0.6%
 
D6NLGP 65 0.6%
 
Other values (1721) 10494 90.7%
 

Length

Max length16
Mean length4.990149486
Min length2
ValueCountFrequency (%) 
Uppercase_Letter 26 66.7%
 
Decimal_Number 10 25.6%
 
Other_Punctuation 1 2.6%
 
Space_Separator 1 2.6%
 
Dash_Punctuation 1 2.6%
 
ValueCountFrequency (%) 
Latin 26 66.7%
 
Common 13 33.3%
 
ValueCountFrequency (%) 
ASCII 39 100.0%
 

fiBaseModel
Categorical

HIGH CARDINALITY
Distinct count786
Unique (%)6.8%
Missing0
Missing (%)0.0%
Memory size90.5 KiB
310
 
532
D6
 
417
580
 
381
D5
 
294
416
 
213
Other values (781)
9736
ValueCountFrequency (%) 
310 532 4.6%
 
D6 417 3.6%
 
580 381 3.3%
 
D5 294 2.5%
 
416 213 1.8%
 
950 199 1.7%
 
140 185 1.6%
 
650 170 1.5%
 
D8 170 1.5%
 
12 158 1.4%
 
Other values (776) 8854 76.5%
 

Length

Max length10
Mean length3.299403785
Min length2
ValueCountFrequency (%) 
Uppercase_Letter 24 66.7%
 
Decimal_Number 10 27.8%
 
Other_Punctuation 1 2.8%
 
Space_Separator 1 2.8%
 
ValueCountFrequency (%) 
Latin 24 66.7%
 
Common 12 33.3%
 
ValueCountFrequency (%) 
ASCII 36 100.0%
 

fiSecondaryDesc
Categorical

HIGH CARDINALITY
MISSING
Distinct count81
Unique (%)1.0%
Missing3536
Missing (%)30.6%
Memory size90.5 KiB
G
1491
C
1196
B
962
H
679
D
 
566
Other values (76)
3143
ValueCountFrequency (%) 
G 1491 12.9%
 
C 1196 10.3%
 
B 962 8.3%
 
H 679 5.9%
 
D 566 4.9%
 
E 484 4.2%
 
J 278 2.4%
 
F 267 2.3%
 
M 226 2.0%
 
N 209 1.8%
 
Other values (71) 1679 14.5%
 
(Missing) 3536 30.6%
 

Length

Max length7
Mean length1.776030416
Min length1
ValueCountFrequency (%) 
Uppercase_Letter 22 75.9%
 
Decimal_Number 2 6.9%
 
Other_Punctuation 2 6.9%
 
Lowercase_Letter 2 6.9%
 
Space_Separator 1 3.4%
 
ValueCountFrequency (%) 
Latin 24 82.8%
 
Common 5 17.2%
 
ValueCountFrequency (%) 
ASCII 29 100.0%
 

fiModelSeries
Categorical

HIGH CARDINALITY
MISSING
Distinct count59
Unique (%)3.4%
Missing9814
Missing (%)84.8%
Memory size90.5 KiB
II
568
LC
239
III
 
113
-2
 
81
-5
 
70
Other values (54)
688
ValueCountFrequency (%) 
II 568 4.9%
 
LC 239 2.1%
 
III 113 1.0%
 
-2 81 0.7%
 
-5 70 0.6%
 
-6 68 0.6%
 
-1 55 0.5%
 
IV 48 0.4%
 
V 40 0.3%
 
-3 40 0.3%
 
Other values (49) 437 3.8%
 
(Missing) 9814 84.8%
 

Length

Max length8
Mean length2.864771451
Min length1
ValueCountFrequency (%) 
Uppercase_Letter 16 50.0%
 
Decimal_Number 9 28.1%
 
Other_Punctuation 3 9.4%
 
Lowercase_Letter 2 6.2%
 
Math_Symbol 1 3.1%
 
Dash_Punctuation 1 3.1%
 
ValueCountFrequency (%) 
Latin 18 56.2%
 
Common 14 43.8%
 
ValueCountFrequency (%) 
ASCII 32 100.0%
 

fiModelDescriptor
Categorical

HIGH CARDINALITY
MISSING
Distinct count73
Unique (%)2.5%
Missing8676
Missing (%)75.0%
Memory size90.5 KiB
L
589
LGP
585
LC
545
XL
266
CR
 
114
Other values (68)
798
ValueCountFrequency (%) 
L 589 5.1%
 
LGP 585 5.1%
 
LC 545 4.7%
 
XL 266 2.3%
 
CR 114 1.0%
 
LT 111 1.0%
 
6 59 0.5%
 
5 55 0.5%
 
7 45 0.4%
 
E 43 0.4%
 
Other values (63) 485 4.2%
 
(Missing) 8676 75.0%
 

Length

Max length8
Mean length2.738442928
Min length1
ValueCountFrequency (%) 
Uppercase_Letter 24 63.2%
 
Decimal_Number 9 23.7%
 
Lowercase_Letter 2 5.3%
 
Space_Separator 1 2.6%
 
Other_Punctuation 1 2.6%
 
Math_Symbol 1 2.6%
 
ValueCountFrequency (%) 
Latin 26 68.4%
 
Common 12 31.6%
 
ValueCountFrequency (%) 
ASCII 38 100.0%
 

ProductSize
Categorical

MISSING
Distinct count6
Unique (%)0.1%
Missing5830
Missing (%)50.4%
Memory size90.5 KiB
Medium
2068
Large / Medium
1619
Mini
881
Small
563
Large
421
ValueCountFrequency (%) 
Medium 2068 17.9%
 
Large / Medium 1619 14.0%
 
Mini 881 7.6%
 
Small 563 4.9%
 
Large 421 3.6%
 
Compact 191 1.7%
 
(Missing) 5830 50.4%
 

Length

Max length14
Mean length5.387107924
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 14 70.0%
 
Uppercase_Letter 4 20.0%
 
Other_Punctuation 1 5.0%
 
Space_Separator 1 5.0%
 
ValueCountFrequency (%) 
Latin 18 90.0%
 
Common 2 10.0%
 
ValueCountFrequency (%) 
ASCII 20 100.0%
 

fiProductClassDesc
Categorical

HIGH CARDINALITY
Distinct count67
Unique (%)0.6%
Missing0
Missing (%)0.0%
Memory size90.5 KiB
Backhoe Loader - 14.0 to 15.0 Ft Standard Digging Depth
 
1376
Track Type Tractor, Dozer - 85.0 to 105.0 Horsepower
 
430
Wheel Loader - 150.0 to 175.0 Horsepower
 
423
Hydraulic Excavator, Track - 21.0 to 24.0 Metric Tons
 
413
Track Type Tractor, Dozer - 130.0 to 160.0 Horsepower
 
390
Other values (62)
8541
ValueCountFrequency (%) 
Backhoe Loader - 14.0 to 15.0 Ft Standard Digging Depth 1376 11.9%
 
Track Type Tractor, Dozer - 85.0 to 105.0 Horsepower 430 3.7%
 
Wheel Loader - 150.0 to 175.0 Horsepower 423 3.7%
 
Hydraulic Excavator, Track - 21.0 to 24.0 Metric Tons 413 3.6%
 
Track Type Tractor, Dozer - 130.0 to 160.0 Horsepower 390 3.4%
 
Wheel Loader - 120.0 to 135.0 Horsepower 361 3.1%
 
Skid Steer Loader - 1751.0 to 2201.0 Lb Operating Capacity 357 3.1%
 
Track Type Tractor, Dozer - 20.0 to 75.0 Horsepower 343 3.0%
 
Hydraulic Excavator, Track - 33.0 to 40.0 Metric Tons 307 2.7%
 
Hydraulic Excavator, Track - 3.0 to 4.0 Metric Tons 304 2.6%
 
Other values (57) 6869 59.4%
 

Length

Max length58
Mean length49.67182235
Min length27
ValueCountFrequency (%) 
Lowercase_Letter 23 45.1%
 
Uppercase_Letter 13 25.5%
 
Decimal_Number 10 19.6%
 
Other_Punctuation 2 3.9%
 
Math_Symbol 1 2.0%
 
Space_Separator 1 2.0%
 
Dash_Punctuation 1 2.0%
 
ValueCountFrequency (%) 
Latin 36 70.6%
 
Common 15 29.4%
 
ValueCountFrequency (%) 
ASCII 51 100.0%
 

state
Categorical

Distinct count48
Unique (%)0.4%
Missing0
Missing (%)0.0%
Memory size90.5 KiB
Florida
3376
Texas
1428
California
 
742
Maryland
 
357
Illinois
 
331
Other values (43)
5339
ValueCountFrequency (%) 
Florida 3376 29.2%
 
Texas 1428 12.3%
 
California 742 6.4%
 
Maryland 357 3.1%
 
Illinois 331 2.9%
 
Georgia 324 2.8%
 
Alabama 295 2.5%
 
Pennsylvania 293 2.5%
 
Mississippi 279 2.4%
 
Colorado 275 2.4%
 
Other values (38) 3873 33.5%
 

Length

Max length14
Mean length7.845588871
Min length4
ValueCountFrequency (%) 
Lowercase_Letter 24 52.2%
 
Uppercase_Letter 21 45.7%
 
Space_Separator 1 2.2%
 
ValueCountFrequency (%) 
Latin 45 97.8%
 
Common 1 2.2%
 
ValueCountFrequency (%) 
ASCII 46 100.0%
 

ProductGroup
Categorical

HIGH CORRELATION
Distinct count6
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size90.5 KiB
TEX
3063
WL
2170
TTT
2062
BL
1986
SSL
1523
ValueCountFrequency (%) 
TEX 3063 26.5%
 
WL 2170 18.8%
 
TTT 2062 17.8%
 
BL 1986 17.2%
 
SSL 1523 13.2%
 
MG 769 6.6%
 

Length

Max length3
Mean length2.574440508
Min length2
ValueCountFrequency (%) 
Uppercase_Letter 9 100.0%
 
ValueCountFrequency (%) 
Latin 9 100.0%
 
ValueCountFrequency (%) 
ASCII 9 100.0%
 

ProductGroupDesc
Categorical

HIGH CORRELATION
Distinct count6
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size90.5 KiB
Track Excavators
3063
Wheel Loader
2170
Track Type Tractors
2062
Backhoe Loaders
1986
Skid Steer Loaders
1523
ValueCountFrequency (%) 
Track Excavators 3063 26.5%
 
Wheel Loader 2170 18.8%
 
Track Type Tractors 2062 17.8%
 
Backhoe Loaders 1986 17.2%
 
Skid Steer Loaders 1523 13.2%
 
Motor Graders 769 6.6%
 

Length

Max length19
Mean length15.6767476
Min length12
ValueCountFrequency (%) 
Lowercase_Letter 16 64.0%
 
Uppercase_Letter 8 32.0%
 
Space_Separator 1 4.0%
 
ValueCountFrequency (%) 
Latin 24 96.0%
 
Common 1 4.0%
 
ValueCountFrequency (%) 
ASCII 25 100.0%
 

Drive_System
Categorical

MISSING
Distinct count4
Unique (%)0.1%
Missing8847
Missing (%)76.4%
Memory size90.5 KiB
Two Wheel Drive
1407
No
738
Four Wheel Drive
555
All Wheel Drive
 
26
ValueCountFrequency (%) 
Two Wheel Drive 1407 12.2%
 
No 738 6.4%
 
Four Wheel Drive 555 4.8%
 
All Wheel Drive 26 0.2%
 
(Missing) 8847 76.4%
 

Length

Max length16
Mean length5.045537026
Min length2
ValueCountFrequency (%) 
Lowercase_Letter 11 61.1%
 
Uppercase_Letter 6 33.3%
 
Space_Separator 1 5.6%
 
ValueCountFrequency (%) 
Latin 17 94.4%
 
Common 1 5.6%
 
ValueCountFrequency (%) 
ASCII 18 100.0%
 

Enclosure
Categorical

Distinct count4
Unique (%)< 0.1%
Missing9
Missing (%)0.1%
Memory size90.5 KiB
EROPS w AC
4781
OROPS
4039
EROPS
2743
EROPS AC
 
1
ValueCountFrequency (%) 
EROPS w AC 4781 41.3%
 
OROPS 4039 34.9%
 
EROPS 2743 23.7%
 
EROPS AC 1 < 0.1%
 
(Missing) 9 0.1%
 

Length

Max length10
Mean length7.064287566
Min length3
ValueCountFrequency (%) 
Uppercase_Letter 7 63.6%
 
Lowercase_Letter 3 27.3%
 
Space_Separator 1 9.1%
 
ValueCountFrequency (%) 
Latin 10 90.9%
 
Common 1 9.1%
 
ValueCountFrequency (%) 
ASCII 11 100.0%
 

Forks
Categorical

MISSING
Distinct count2
Unique (%)< 0.1%
Missing5935
Missing (%)51.3%
Memory size90.5 KiB
None or Unspecified
4761
Yes
 
877
ValueCountFrequency (%) 
None or Unspecified 4761 41.1%
 
Yes 877 7.6%
 
(Missing) 5935 51.3%
 

Length

Max length19
Mean length9.58221723
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 11 73.3%
 
Uppercase_Letter 3 20.0%
 
Space_Separator 1 6.7%
 
ValueCountFrequency (%) 
Latin 14 93.3%
 
Common 1 6.7%
 
ValueCountFrequency (%) 
ASCII 15 100.0%
 

Pad_Type
Categorical

MISSING
Distinct count4
Unique (%)0.2%
Missing9611
Missing (%)83.0%
Memory size90.5 KiB
None or Unspecified
1781
Reversible
 
118
Street
 
62
Grouser
 
1
ValueCountFrequency (%) 
None or Unspecified 1781 15.4%
 
Reversible 118 1.0%
 
Street 62 0.5%
 
Grouser 1 < 0.1%
 
(Missing) 9611 83.0%
 

Length

Max length19
Mean length5.550073447
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 16 72.7%
 
Uppercase_Letter 5 22.7%
 
Space_Separator 1 4.5%
 
ValueCountFrequency (%) 
Latin 21 95.5%
 
Common 1 4.5%
 
ValueCountFrequency (%) 
ASCII 22 100.0%
 

Ride_Control
Categorical

MISSING
Distinct count3
Unique (%)0.1%
Missing7451
Missing (%)64.4%
Memory size90.5 KiB
No
1704
None or Unspecified
1577
Yes
841
ValueCountFrequency (%) 
No 1704 14.7%
 
None or Unspecified 1577 13.6%
 
Yes 841 7.3%
 
(Missing) 7451 64.4%
 

Length

Max length19
Mean length5.033007863
Min length2
ValueCountFrequency (%) 
Lowercase_Letter 11 73.3%
 
Uppercase_Letter 3 20.0%
 
Space_Separator 1 6.7%
 
ValueCountFrequency (%) 
Latin 14 93.3%
 
Common 1 6.7%
 
ValueCountFrequency (%) 
ASCII 15 100.0%
 

Stick
Categorical

MISSING
Distinct count2
Unique (%)0.1%
Missing9611
Missing (%)83.0%
Memory size90.5 KiB
Standard
1025
Extended
937
ValueCountFrequency (%) 
Standard 1025 8.9%
 
Extended 937 8.1%
 
(Missing) 9611 83.0%
 

Length

Max length8
Mean length3.847662663
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 7 77.8%
 
Uppercase_Letter 2 22.2%
 
ValueCountFrequency (%) 
Latin 9 100.0%
 
ValueCountFrequency (%) 
ASCII 9 100.0%
 

Transmission
Categorical

MISSING
Distinct count7
Unique (%)0.1%
Missing6796
Missing (%)58.7%
Memory size90.5 KiB
Standard
3587
None or Unspecified
742
Powershift
 
260
Hydrostatic
 
138
Powershuttle
 
42
Other values (2)
 
8
ValueCountFrequency (%) 
Standard 3587 31.0%
 
None or Unspecified 742 6.4%
 
Powershift 260 2.2%
 
Hydrostatic 138 1.2%
 
Powershuttle 42 0.4%
 
Autoshift 4 < 0.1%
 
Direct Drive 4 < 0.1%
 
(Missing) 6796 58.7%
 

Length

Max length19
Mean length5.866067571
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 18 69.2%
 
Uppercase_Letter 7 26.9%
 
Space_Separator 1 3.8%
 
ValueCountFrequency (%) 
Latin 25 96.2%
 
Common 1 3.8%
 
ValueCountFrequency (%) 
ASCII 26 100.0%
 

Turbocharged
Categorical

MISSING
Distinct count2
Unique (%)0.1%
Missing9611
Missing (%)83.0%
Memory size90.5 KiB
None or Unspecified
1900
Yes
 
62
ValueCountFrequency (%) 
None or Unspecified 1900 16.4%
 
Yes 62 0.5%
 
(Missing) 9611 83.0%
 

Length

Max length19
Mean length5.626803767
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 11 73.3%
 
Uppercase_Letter 3 20.0%
 
Space_Separator 1 6.7%
 
ValueCountFrequency (%) 
Latin 14 93.3%
 
Common 1 6.7%
 
ValueCountFrequency (%) 
ASCII 15 100.0%
 

Blade_Extension
Categorical

MISSING
Distinct count2
Unique (%)0.3%
Missing10809
Missing (%)93.4%
Memory size90.5 KiB
None or Unspecified
714
Yes
 
50
ValueCountFrequency (%) 
None or Unspecified 714 6.2%
 
Yes 50 0.4%
 
(Missing) 10809 93.4%
 

Length

Max length19
Mean length3.987125205
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 11 73.3%
 
Uppercase_Letter 3 20.0%
 
Space_Separator 1 6.7%
 
ValueCountFrequency (%) 
Latin 14 93.3%
 
Common 1 6.7%
 
ValueCountFrequency (%) 
ASCII 15 100.0%
 

Blade_Width
Categorical

MISSING
Distinct count6
Unique (%)0.8%
Missing10809
Missing (%)93.4%
Memory size90.5 KiB
14'
252
None or Unspecified
238
12'
233
16'
 
27
<12'
 
8
ValueCountFrequency (%) 
14' 252 2.2%
 
None or Unspecified 238 2.1%
 
12' 233 2.0%
 
16' 27 0.2%
 
<12' 8 0.1%
 
13' 6 0.1%
 
(Missing) 10809 93.4%
 

Length

Max length19
Mean length3.329732999
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 11 52.4%
 
Decimal_Number 5 23.8%
 
Uppercase_Letter 2 9.5%
 
Other_Punctuation 1 4.8%
 
Space_Separator 1 4.8%
 
Math_Symbol 1 4.8%
 
ValueCountFrequency (%) 
Latin 13 61.9%
 
Common 8 38.1%
 
ValueCountFrequency (%) 
ASCII 21 100.0%
 

Enclosure_Type
Categorical

MISSING
Distinct count3
Unique (%)0.4%
Missing10809
Missing (%)93.4%
Memory size90.5 KiB
None or Unspecified
546
Low Profile
165
High Profile
 
53
ValueCountFrequency (%) 
None or Unspecified 546 4.7%
 
Low Profile 165 1.4%
 
High Profile 53 0.5%
 
(Missing) 10809 93.4%
 

Length

Max length19
Mean length3.910135661
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 15 71.4%
 
Uppercase_Letter 5 23.8%
 
Space_Separator 1 4.8%
 
ValueCountFrequency (%) 
Latin 20 95.2%
 
Common 1 4.8%
 
ValueCountFrequency (%) 
ASCII 21 100.0%
 

Engine_Horsepower
Categorical

MISSING
Distinct count2
Unique (%)0.3%
Missing10809
Missing (%)93.4%
Memory size90.5 KiB
No
705
Variable
 
59
ValueCountFrequency (%) 
No 705 6.1%
 
Variable 59 0.5%
 
(Missing) 10809 93.4%
 

Length

Max length8
Mean length2.964572712
Min length2
ValueCountFrequency (%) 
Lowercase_Letter 8 80.0%
 
Uppercase_Letter 2 20.0%
 
ValueCountFrequency (%) 
Latin 10 100.0%
 
ValueCountFrequency (%) 
ASCII 10 100.0%
 

Hydraulics
Categorical

MISSING
Distinct count11
Unique (%)0.1%
Missing2010
Missing (%)17.4%
Memory size90.5 KiB
2 Valve
3913
Auxiliary
2487
Standard
2092
Base + 1 Function
 
741
3 Valve
 
185
Other values (6)
 
145
ValueCountFrequency (%) 
2 Valve 3913 33.8%
 
Auxiliary 2487 21.5%
 
Standard 2092 18.1%
 
Base + 1 Function 741 6.4%
 
3 Valve 185 1.6%
 
4 Valve 117 1.0%
 
Base + 3 Function 12 0.1%
 
Base + 4 Function 5 < 0.1%
 
Base + 2 Function 5 < 0.1%
 
Base + 5 Function 5 < 0.1%
 
(Missing) 2010 17.4%
 

Length

Max length17
Mean length7.580316253
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 15 53.6%
 
Decimal_Number 6 21.4%
 
Uppercase_Letter 5 17.9%
 
Space_Separator 1 3.6%
 
Math_Symbol 1 3.6%
 
ValueCountFrequency (%) 
Latin 20 71.4%
 
Common 8 28.6%
 
ValueCountFrequency (%) 
ASCII 28 100.0%
 

Pushblock
Categorical

MISSING
Distinct count2
Unique (%)0.3%
Missing10809
Missing (%)93.4%
Memory size90.5 KiB
None or Unspecified
554
Yes
210
ValueCountFrequency (%) 
None or Unspecified 554 4.8%
 
Yes 210 1.8%
 
(Missing) 10809 93.4%
 

Length

Max length19
Mean length3.765920677
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 11 73.3%
 
Uppercase_Letter 3 20.0%
 
Space_Separator 1 6.7%
 
ValueCountFrequency (%) 
Latin 14 93.3%
 
Common 1 6.7%
 
ValueCountFrequency (%) 
ASCII 15 100.0%
 

Ripper
Categorical

MISSING
Distinct count4
Unique (%)0.1%
Missing8765
Missing (%)75.7%
Memory size90.5 KiB
None or Unspecified
1953
Multi Shank
438
Yes
 
283
Single Shank
 
134
ValueCountFrequency (%) 
None or Unspecified 1953 16.9%
 
Multi Shank 438 3.8%
 
Yes 283 2.4%
 
Single Shank 134 1.2%
 
(Missing) 8765 75.7%
 

Length

Max length19
Mean length6.107059535
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 17 73.9%
 
Uppercase_Letter 5 21.7%
 
Space_Separator 1 4.3%
 
ValueCountFrequency (%) 
Latin 22 95.7%
 
Common 1 4.3%
 
ValueCountFrequency (%) 
ASCII 23 100.0%
 

Scarifier
Categorical

MISSING
Distinct count2
Unique (%)0.3%
Missing10809
Missing (%)93.4%
Memory size90.5 KiB
Yes
450
None or Unspecified
314
ValueCountFrequency (%) 
Yes 450 3.9%
 
None or Unspecified 314 2.7%
 
(Missing) 10809 93.4%
 

Length

Max length19
Mean length3.434113886
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 11 73.3%
 
Uppercase_Letter 3 20.0%
 
Space_Separator 1 6.7%
 
ValueCountFrequency (%) 
Latin 14 93.3%
 
Common 1 6.7%
 
ValueCountFrequency (%) 
ASCII 15 100.0%
 

Tip_Control
Categorical

MISSING
Distinct count3
Unique (%)0.4%
Missing10809
Missing (%)93.4%
Memory size90.5 KiB
None or Unspecified
625
Sideshift & Tip
 
94
Tip
 
45
ValueCountFrequency (%) 
None or Unspecified 625 5.4%
 
Sideshift & Tip 94 0.8%
 
Tip 45 0.4%
 
(Missing) 10809 93.4%
 

Length

Max length19
Mean length3.961548432
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 13 68.4%
 
Uppercase_Letter 4 21.1%
 
Other_Punctuation 1 5.3%
 
Space_Separator 1 5.3%
 
ValueCountFrequency (%) 
Latin 17 89.5%
 
Common 2 10.5%
 
ValueCountFrequency (%) 
ASCII 19 100.0%
 

Tire_Size
Categorical

MISSING
Distinct count15
Unique (%)0.5%
Missing8653
Missing (%)74.8%
Memory size90.5 KiB
None or Unspecified
1484
20.5
531
14"
298
23.5
 
280
26.5
 
153
Other values (10)
 
174
ValueCountFrequency (%) 
None or Unspecified 1484 12.8%
 
20.5 531 4.6%
 
14" 298 2.6%
 
23.5 280 2.4%
 
26.5 153 1.3%
 
29.5 63 0.5%
 
17.5 39 0.3%
 
17.5" 22 0.2%
 
20.5" 19 0.2%
 
13" 10 0.1%
 
Other values (5) 21 0.2%
 
(Missing) 8653 74.8%
 

Length

Max length19
Mean length5.153719865
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 11 44.0%
 
Decimal_Number 9 36.0%
 
Uppercase_Letter 2 8.0%
 
Other_Punctuation 2 8.0%
 
Space_Separator 1 4.0%
 
ValueCountFrequency (%) 
Latin 13 52.0%
 
Common 12 48.0%
 
ValueCountFrequency (%) 
ASCII 25 100.0%
 

Coupler
Categorical

MISSING
Distinct count3
Unique (%)< 0.1%
Missing4846
Missing (%)41.9%
Memory size90.5 KiB
None or Unspecified
5867
Manual
 
617
Hydraulic
 
243
ValueCountFrequency (%) 
None or Unspecified 5867 50.7%
 
Manual 617 5.3%
 
Hydraulic 243 2.1%
 
(Missing) 4846 41.9%
 

Length

Max length19
Mean length11.39721766
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 14 73.7%
 
Uppercase_Letter 4 21.1%
 
Space_Separator 1 5.3%
 
ValueCountFrequency (%) 
Latin 18 94.7%
 
Common 1 5.3%
 
ValueCountFrequency (%) 
ASCII 19 100.0%
 

Coupler_System
Categorical

MISSING
Distinct count2
Unique (%)0.1%
Missing10057
Missing (%)86.9%
Memory size90.5 KiB
None or Unspecified
1297
Yes
 
219
ValueCountFrequency (%) 
None or Unspecified 1297 11.2%
 
Yes 219 1.9%
 
(Missing) 10057 86.9%
 

Length

Max length19
Mean length4.793139203
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 11 73.3%
 
Uppercase_Letter 3 20.0%
 
Space_Separator 1 6.7%
 
ValueCountFrequency (%) 
Latin 14 93.3%
 
Common 1 6.7%
 
ValueCountFrequency (%) 
ASCII 15 100.0%
 

Grouser_Tracks
Categorical

MISSING
Distinct count2
Unique (%)0.1%
Missing10060
Missing (%)86.9%
Memory size90.5 KiB
None or Unspecified
1305
Yes
 
208
ValueCountFrequency (%) 
None or Unspecified 1305 11.3%
 
Yes 208 1.8%
 
(Missing) 10060 86.9%
 

Length

Max length19
Mean length4.80419943
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 11 73.3%
 
Uppercase_Letter 3 20.0%
 
Space_Separator 1 6.7%
 
ValueCountFrequency (%) 
Latin 14 93.3%
 
Common 1 6.7%
 
ValueCountFrequency (%) 
ASCII 15 100.0%
 

Hydraulics_Flow
Categorical

MISSING
Distinct count3
Unique (%)0.2%
Missing10060
Missing (%)86.9%
Memory size90.5 KiB
Standard
1467
High Flow
 
44
None or Unspecified
 
2
ValueCountFrequency (%) 
Standard 1467 12.7%
 
High Flow 44 0.4%
 
None or Unspecified 2 < 0.1%
 
(Missing) 10060 86.9%
 

Length

Max length19
Mean length3.65937959
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 16 72.7%
 
Uppercase_Letter 5 22.7%
 
Space_Separator 1 4.5%
 
ValueCountFrequency (%) 
Latin 21 95.5%
 
Common 1 4.5%
 
ValueCountFrequency (%) 
ASCII 22 100.0%
 

Track_Type
Categorical

MISSING
Distinct count2
Unique (%)0.1%
Missing8533
Missing (%)73.7%
Memory size90.5 KiB
Steel
2583
Rubber
 
457
ValueCountFrequency (%) 
Steel 2583 22.3%
 
Rubber 457 3.9%
 
(Missing) 8533 73.7%
 

Length

Max length6
Mean length3.564849218
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 8 80.0%
 
Uppercase_Letter 2 20.0%
 
ValueCountFrequency (%) 
Latin 10 100.0%
 
ValueCountFrequency (%) 
ASCII 10 100.0%
 

Undercarriage_Pad_Width
Categorical

MISSING
Distinct count16
Unique (%)0.5%
Missing8529
Missing (%)73.7%
Memory size90.5 KiB
None or Unspecified
2793
32 inch
 
74
28 inch
 
38
24 inch
 
36
36 inch
 
25
Other values (11)
 
78
ValueCountFrequency (%) 
None or Unspecified 2793 24.1%
 
32 inch 74 0.6%
 
28 inch 38 0.3%
 
24 inch 36 0.3%
 
36 inch 25 0.2%
 
16 inch 21 0.2%
 
20 inch 12 0.1%
 
30 inch 10 0.1%
 
18 inch 10 0.1%
 
34 inch 9 0.1%
 
Other values (6) 16 0.1%
 
(Missing) 8529 73.7%
 

Length

Max length19
Mean length6.948155189
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 12 52.2%
 
Decimal_Number 8 34.8%
 
Uppercase_Letter 2 8.7%
 
Space_Separator 1 4.3%
 
ValueCountFrequency (%) 
Latin 14 60.9%
 
Common 9 39.1%
 
ValueCountFrequency (%) 
ASCII 23 100.0%
 

Stick_Length
Categorical

MISSING
Distinct count21
Unique (%)0.7%
Missing8530
Missing (%)73.7%
Memory size90.5 KiB
None or Unspecified
2719
9' 6"
 
67
10' 6"
 
63
9' 8"
 
28
11' 0"
 
24
Other values (16)
 
142
ValueCountFrequency (%) 
None or Unspecified 2719 23.5%
 
9' 6" 67 0.6%
 
10' 6" 63 0.5%
 
9' 8" 28 0.2%
 
11' 0" 24 0.2%
 
9' 10" 24 0.2%
 
9' 7" 22 0.2%
 
10' 2" 21 0.2%
 
12' 10" 21 0.2%
 
10' 10" 10 0.1%
 
Other values (11) 44 0.4%
 
(Missing) 8530 73.7%
 

Length

Max length19
Mean length6.834787868
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 11 42.3%
 
Decimal_Number 10 38.5%
 
Uppercase_Letter 2 7.7%
 
Other_Punctuation 2 7.7%
 
Space_Separator 1 3.8%
 
ValueCountFrequency (%) 
Common 13 50.0%
 
Latin 13 50.0%
 
ValueCountFrequency (%) 
ASCII 26 100.0%
 

Thumb
Categorical

MISSING
Distinct count3
Unique (%)0.1%
Missing8529
Missing (%)73.7%
Memory size90.5 KiB
None or Unspecified
1981
Hydraulic
743
Manual
 
320
ValueCountFrequency (%) 
None or Unspecified 1981 17.1%
 
Hydraulic 743 6.4%
 
Manual 320 2.8%
 
(Missing) 8529 73.7%
 

Length

Max length19
Mean length6.206947205
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 14 73.7%
 
Uppercase_Letter 4 21.1%
 
Space_Separator 1 5.3%
 
ValueCountFrequency (%) 
Latin 18 94.7%
 
Common 1 5.3%
 
ValueCountFrequency (%) 
ASCII 19 100.0%
 

Pattern_Changer
Categorical

MISSING
Distinct count3
Unique (%)0.1%
Missing8530
Missing (%)73.7%
Memory size90.5 KiB
None or Unspecified
2669
Yes
 
371
No
 
3
ValueCountFrequency (%) 
None or Unspecified 2669 23.1%
 
Yes 371 3.2%
 
No 3 < 0.1%
 
(Missing) 8530 73.7%
 

Length

Max length19
Mean length6.689708805
Min length2
ValueCountFrequency (%) 
Lowercase_Letter 11 73.3%
 
Uppercase_Letter 3 20.0%
 
Space_Separator 1 6.7%
 
ValueCountFrequency (%) 
Latin 14 93.3%
 
Common 1 6.7%
 
ValueCountFrequency (%) 
ASCII 15 100.0%
 

Grouser_Type
Categorical

MISSING
Distinct count2
Unique (%)0.1%
Missing8533
Missing (%)73.7%
Memory size90.5 KiB
Double
2345
Triple
695
ValueCountFrequency (%) 
Double 2345 20.3%
 
Triple 695 6.0%
 
(Missing) 8533 73.7%
 

Length

Max length6
Mean length3.78804113
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 10 83.3%
 
Uppercase_Letter 2 16.7%
 
ValueCountFrequency (%) 
Latin 12 100.0%
 
ValueCountFrequency (%) 
ASCII 12 100.0%
 

Backhoe_Mounting
Categorical

MISSING
Distinct count1
Unique (%)< 0.1%
Missing9533
Missing (%)82.4%
Memory size90.5 KiB
None or Unspecified
2040
ValueCountFrequency (%) 
None or Unspecified 2040 17.6%
 
(Missing) 9533 82.4%
 

Length

Max length19
Mean length5.820357729
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 11 78.6%
 
Uppercase_Letter 2 14.3%
 
Space_Separator 1 7.1%
 
ValueCountFrequency (%) 
Latin 13 92.9%
 
Common 1 7.1%
 
ValueCountFrequency (%) 
ASCII 14 100.0%
 

Blade_Type
Categorical

MISSING
Distinct count8
Unique (%)0.4%
Missing9531
Missing (%)82.4%
Memory size90.5 KiB
PAT
1021
None or Unspecified
410
Semi U
290
Straight
 
138
VPAT
 
134
Other values (3)
 
49
ValueCountFrequency (%) 
PAT 1021 8.8%
 
None or Unspecified 410 3.5%
 
Semi U 290 2.5%
 
Straight 138 1.2%
 
VPAT 134 1.2%
 
U 26 0.2%
 
Angle 22 0.2%
 
Landfill 1 < 0.1%
 
(Missing) 9531 82.4%
 

Length

Max length19
Mean length3.712952562
Min length1
ValueCountFrequency (%) 
Lowercase_Letter 16 64.0%
 
Uppercase_Letter 8 32.0%
 
Space_Separator 1 4.0%
 
ValueCountFrequency (%) 
Latin 24 96.0%
 
Common 1 4.0%
 
ValueCountFrequency (%) 
ASCII 25 100.0%
 

Travel_Controls
Categorical

MISSING
Distinct count7
Unique (%)0.3%
Missing9530
Missing (%)82.3%
Memory size90.5 KiB
None or Unspecified
1524
Differential Steer
378
Finger Tip
 
69
Lever
 
62
Pedal
 
7
Other values (2)
 
3
ValueCountFrequency (%) 
None or Unspecified 1524 13.2%
 
Differential Steer 378 3.3%
 
Finger Tip 69 0.6%
 
Lever 62 0.5%
 
Pedal 7 0.1%
 
2 Pedal 2 < 0.1%
 
1 Speed 1 < 0.1%
 
(Missing) 9530 82.3%
 

Length

Max length19
Mean length5.651602869
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 15 57.7%
 
Uppercase_Letter 8 30.8%
 
Decimal_Number 2 7.7%
 
Space_Separator 1 3.8%
 
ValueCountFrequency (%) 
Latin 23 88.5%
 
Common 3 11.5%
 
ValueCountFrequency (%) 
ASCII 26 100.0%
 

Differential_Type
Categorical

MISSING
Distinct count3
Unique (%)0.1%
Missing9420
Missing (%)81.4%
Memory size90.5 KiB
Standard
2096
Limited Slip
 
51
No Spin
 
6
ValueCountFrequency (%) 
Standard 2096 18.1%
 
Limited Slip 51 0.4%
 
No Spin 6 0.1%
 
(Missing) 9420 81.4%
 

Length

Max length12
Mean length3.947291109
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 11 73.3%
 
Uppercase_Letter 3 20.0%
 
Space_Separator 1 6.7%
 
ValueCountFrequency (%) 
Latin 14 93.3%
 
Common 1 6.7%
 
ValueCountFrequency (%) 
ASCII 15 100.0%
 

Steering_Controls
Categorical

MISSING
Distinct count3
Unique (%)0.1%
Missing9420
Missing (%)81.4%
Memory size90.5 KiB
Conventional
2095
Command Control
 
57
Four Wheel Standard
 
1
ValueCountFrequency (%) 
Conventional 2095 18.1%
 
Command Control 57 0.5%
 
Four Wheel Standard 1 < 0.1%
 
(Missing) 9420 81.4%
 

Length

Max length19
Mean length4.689708805
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 13 72.2%
 
Uppercase_Letter 4 22.2%
 
Space_Separator 1 5.6%
 
ValueCountFrequency (%) 
Latin 17 94.4%
 
Common 1 5.6%
 
ValueCountFrequency (%) 
ASCII 18 100.0%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

SalesIDMachineIDModelIDdatasourceauctioneerIDYearMadeMachineHoursCurrentMeterUsageBandsaledatefiModelDescfiBaseModelfiSecondaryDescfiModelSeriesfiModelDescriptorProductSizefiProductClassDescstateProductGroupProductGroupDescDrive_SystemEnclosureForksPad_TypeRide_ControlStickTransmissionTurbochargedBlade_ExtensionBlade_WidthEnclosure_TypeEngine_HorsepowerHydraulicsPushblockRipperScarifierTip_ControlTire_SizeCouplerCoupler_SystemGrouser_TracksHydraulics_FlowTrack_TypeUndercarriage_Pad_WidthStick_LengthThumbPattern_ChangerGrouser_TypeBackhoe_MountingBlade_TypeTravel_ControlsDifferential_TypeSteering_Controls
012228379028591376121310000.0NaN1/5/2012 0:00375L375NaNNaNLLarge / MediumHydraulic Excavator, Track - 66.0 to 90.0 Metric TonsKentuckyTEXTrack ExcavatorsNaNEROPSNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNStandardNaNNaNNaNNaNNaNNone or UnspecifiedNaNNaNNaNSteelNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedDoubleNaNNaNNaNNaNNaN
11222839104832036526121320064412.0Medium1/5/2012 0:00TX300LC2TX300LC2NaNLarge / MediumHydraulic Excavator, Track - 28.0 to 33.0 Metric TonsConnecticutTEXTrack ExcavatorsNaNEROPS w ACNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNAuxiliaryNaNNaNNaNNaNNaNHydraulicNaNNaNNaNSteelNone or Unspecified12' 4"None or UnspecifiedYesDoubleNaNNaNNaNNaNNaN
2122284199930845871213200010127.0Medium1/5/2012 0:00270LC270NaNNaNLCLarge / MediumHydraulic Excavator, Track - 24.0 to 28.0 Metric TonsConnecticutTEXTrack ExcavatorsNaNEROPS w ACNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNAuxiliaryNaNNaNNaNNaNNaNNone or UnspecifiedNaNNaNNaNSteelNone or Unspecified12' 4"None or UnspecifiedNone or UnspecifiedDoubleNaNNaNNaNNaNNaN
3122284310624251954121310004682.0Low1/5/2012 0:00892DLC892DNaNLCLarge / MediumHydraulic Excavator, Track - 28.0 to 33.0 Metric TonsConnecticutTEXTrack ExcavatorsNaNEROPSNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNStandardNaNNaNNaNNaNNaNNone or UnspecifiedNaNNaNNaNSteelNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedDoubleNaNNaNNaNNaNNaN
4122284510328414701121320028150.0Medium1/4/2012 0:00544H544HNaNNaNNaNWheel Loader - 120.0 to 135.0 HorsepowerFloridaWLWheel LoaderNaNEROPS w ACNone or UnspecifiedNaNNone or UnspecifiedNaNNaNNaNNaNNaNNaNNaN2 ValveNaNNaNNaNNaN20.5ManualNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNStandardConventional
512228475307907019121320010.0NaN1/5/2012 0:00246246NaNNaNNaNNaNSkid Steer Loader - 1751.0 to 2201.0 Lb Operating CapacityFloridaSSLSkid Steer LoadersNaNOROPSNone or UnspecifiedNaNNaNNaNNaNNaNNaNNaNNaNNaNAuxiliaryNaNNaNNaNNaNNaNManualYesNone or UnspecifiedStandardNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
6122284910405203854121310001529.0Low1/5/2012 0:00966C966CNaNNaNMediumWheel Loader - 150.0 to 175.0 HorsepowerIllinoisWLWheel LoaderNaNEROPSNone or UnspecifiedNaNNone or UnspecifiedNaNNaNNaNNaNNaNNaNNaN2 ValveNaNNaNNaNNaN23.5None or UnspecifiedNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNStandardConventional
7122285010617303854121310003998.0Low1/5/2012 0:00966C966CNaNNaNMediumWheel Loader - 150.0 to 175.0 HorsepowerIllinoisWLWheel LoaderNaNEROPSNone or UnspecifiedNaNNone or UnspecifiedNaNNaNNaNNaNNaNNaNNaN2 ValveNaNNaNNaNNaN23.5None or UnspecifiedNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNStandardConventional
8122285553139323926121310008145.0Low1/4/2012 0:0012HNA12HNaNNaNNaNMotorgrader - 130.0 to 145.0 HorsepowerFloridaMGMotor GradersNoOROPSNaNNaNNaNNaNNone or UnspecifiedNaNNone or Unspecified14'High ProfileNoBase + 1 FunctionNone or UnspecifiedNone or UnspecifiedYesNone or Unspecified17.5"NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
912228633172874106121320023211.0Low1/5/2012 0:00D4GD4GNaNNaNNaNTrack Type Tractor, Dozer - 75.0 to 85.0 HorsepowerWest VirginiaTTTTrack Type TractorsNaNOROPSNaNNaNNaNNaNHydrostaticNaNNaNNaNNaNNaN2 ValveNaNNone or UnspecifiedNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNone or UnspecifiedPATNone or UnspecifiedNaNNaN

Last rows

SalesIDMachineIDModelIDdatasourceauctioneerIDYearMadeMachineHoursCurrentMeterUsageBandsaledatefiModelDescfiBaseModelfiSecondaryDescfiModelSeriesfiModelDescriptorProductSizefiProductClassDescstateProductGroupProductGroupDescDrive_SystemEnclosureForksPad_TypeRide_ControlStickTransmissionTurbochargedBlade_ExtensionBlade_WidthEnclosure_TypeEngine_HorsepowerHydraulicsPushblockRipperScarifierTip_ControlTire_SizeCouplerCoupler_SystemGrouser_TracksHydraulics_FlowTrack_TypeUndercarriage_Pad_WidthStick_LengthThumbPattern_ChangerGrouser_TypeBackhoe_MountingBlade_TypeTravel_ControlsDifferential_TypeSteering_Controls
11563633330518002592143714912006NaNNaN2/13/2012 0:0035N35NNaNNaNMiniHydraulic Excavator, Track - 3.0 to 4.0 Metric TonsFloridaTEXTrack ExcavatorsNaNEROPSNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNStandardNaNNaNNaNNaNNaNNone or UnspecifiedNaNNaNNaNSteelNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedDoubleNaNNaNNaNNaNNaN
11564633331419081622143714922006NaNNaN1/28/2012 0:0035N35NNaNNaNMiniHydraulic Excavator, Track - 3.0 to 4.0 Metric TonsFloridaTEXTrack ExcavatorsNaNEROPSNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNAuxiliaryNaNNaNNaNNaNNaNNone or UnspecifiedNaNNaNNaNRubberNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedDoubleNaNNaNNaNNaNNaN
11565633333018799232144614922006NaNNaN1/28/2012 0:0055N255N2NaNMiniHydraulic Excavator, Track - 5.0 to 6.0 Metric TonsFloridaTEXTrack ExcavatorsNaNEROPSNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNAuxiliaryNaNNaNNaNNaNNaNNone or UnspecifiedNaNNaNNaNRubberNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedDoubleNaNNaNNaNNaNNaN
11566633333918568452143514922005NaNNaN1/28/2012 0:0030NX30NXNaNNaNMiniHydraulic Excavator, Track - 2.0 to 3.0 Metric TonsFloridaTEXTrack ExcavatorsNaNEROPSNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNAuxiliaryNaNNaNNaNNaNNaNNone or UnspecifiedNaNNaNNaNRubberNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedDoubleNaNNaNNaNNaNNaN
11567633334317996142143514912005NaNNaN2/13/2012 0:0030NX30NXNaNNaNMiniHydraulic Excavator, Track - 2.0 to 3.0 Metric TonsFloridaTEXTrack ExcavatorsNaNEROPSNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNStandardNaNNaNNaNNaNNaNNone or UnspecifiedNaNNaNNaNSteelNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedDoubleNaNNaNNaNNaNNaN
11568633334419192012143514922005NaNNaN3/7/2012 0:0030NX30NXNaNNaNMiniHydraulic Excavator, Track - 2.0 to 3.0 Metric TonsTexasTEXTrack ExcavatorsNaNEROPSNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNStandardNaNNaNNaNNaNNaNNone or UnspecifiedNaNNaNNaNSteelNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedDoubleNaNNaNNaNNaNNaN
11569633334518821222143614922005NaNNaN1/28/2012 0:0030NX230NX2NaNMiniHydraulic Excavator, Track - 3.0 to 4.0 Metric TonsFloridaTEXTrack ExcavatorsNaNEROPSNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNAuxiliaryNaNNaNNaNNaNNaNNone or UnspecifiedNaNNaNNaNSteelNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedDoubleNaNNaNNaNNaNNaN
11570633334719442132143514922005NaNNaN1/28/2012 0:0030NX30NXNaNNaNMiniHydraulic Excavator, Track - 2.0 to 3.0 Metric TonsFloridaTEXTrack ExcavatorsNaNEROPSNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNAuxiliaryNaNNaNNaNNaNNaNNone or UnspecifiedNaNNaNNaNRubberNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedDoubleNaNNaNNaNNaNNaN
11571633334817945182143514922006NaNNaN3/7/2012 0:0030NX30NXNaNNaNMiniHydraulic Excavator, Track - 2.0 to 3.0 Metric TonsTexasTEXTrack ExcavatorsNaNEROPSNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNAuxiliaryNaNNaNNaNNaNNaNNone or UnspecifiedNaNNaNNaNRubberNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedDoubleNaNNaNNaNNaNNaN
11572633334919447432143614922006NaNNaN1/28/2012 0:0030NX230NX2NaNMiniHydraulic Excavator, Track - 3.0 to 4.0 Metric TonsFloridaTEXTrack ExcavatorsNaNEROPSNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNAuxiliaryNaNNaNNaNNaNNaNNone or UnspecifiedNaNNaNNaNRubberNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedNone or UnspecifiedDoubleNaNNaNNaNNaNNaN